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CLAIMS 



1. (currently amended) A process for real time analysis of any of text 
and[[/or]] media content and relating information to the content, comprising the 
5 steps of: 

analyzing said content in real time; 

wherein said analyzing step analyzes said content for semantic and 
conceptual use; 

providing a set of i nformat i ona l reference documents; 
10 wherein said i nformat i ona l reference documents comprise any of text, 

Web, and media documents; 

providing a pre-processed analysis of said i nformat i ona l reference 
documents; 

wherein said pre-processed analysis is an analysis of said i nformat i ona l 
1 5 reference documents for semantic and conceptual use; 

identifying i nformat i ona l reference documents related to said analyzed 
content using said pre-processed analysis; 

providing a user with a description of each identified informat i ona l 
reference document; 

20 accepting user input for selecting an identified i nformat i ona l reference 

document; and 

displaying the selected identified informationa l reference document to the 

user. 

25 2. (currently amended) The process of Claim 1 , wherein said identifying step 
identifies related i nformationa l reference documents by finding i nformat i ona l 
reference documents that are similar in words, semantically or conceptually, to 
the analyzed content. 

30 3. (currently amended) The process of Claim 1, further comprising the step 
of: 
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storing descriptors for each i nformat i onal reference document, 
retrieving descriptions of each identified i nformationa l reference document 
from said stored descriptors. 

5 4. (currently amended) The process of Claim 1, wherein said set of 
i nf ormati o n al reference documents are stored in a central storage device. 

5. (currently amended) The process of Claim 1, wherein said pre-processed 
analysis creates a list of words and calculates the frequency that the words 

10 appear in said set of i nformat i ona l reference documents. 

6. (original) The process of Claim 5, wherein said pre-processed analysis 
translates similar words into the same word. 

15 7. (currently amended) The process of Claim 1, wherein said pre-processed 
analysis generates collocations of words that appear together and calculates the 
frequency of pairs of words and the frequency of the words appearing together in 
said informat i onal reference documents. 



20 8. (original) The process of Claim 7, wherein said pre-processed analysis 
finds relations between collocations to learn their meaning/context. 

9. (currently amended) The process of Claim 1, wherein said pre-processed 
analysis uses a signature algorithm to calculate signatures for blocks of text, 

25 wherein a signature is a vector of words and their weighting within a[[n]] 
i nformat i ona l reference document; wherein the weighting is determined by the 
importance of a word in the collocations and within the document. 

10. (original) The process of Claim 9, wherein said pre-processed analysis 
30 calculates signatures for Web pages, text tags associated with images, and 

blocks of text. 
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1 1 . (currently amended) The process of Claim 9, wherein said pre-processed 
analysis creates an index for each word from a signature vector for a[[n]] 
i nform a tiona 4 reference document and saves the index, word, text document, and 

5 weight of the word into a database that is used to find text documents that have 
similar signatures. 

12. (original) The process of Claim 9, wherein said pre-processed analysis 
uses the signatures and weights of the words to create sets of documents that 

1 0 have similar signatures. 

13. (currently amended) The process of Claim 1, further comprising the step 
of: 

collecting text documents and multimedia from Web pages across the 
15 Internet using a Web crawler and placing them into said set of i nformat i ona l 
reference documents. 

14. (currently amended) A process for real time analysis of any of text 
and[[/or]] media content in a workflow application and relating information to the 

20 content, comprising the steps of: 

automatically analyzing said content in real time as said content is being 
entered or reviewed by a user; 

wherein said analyzing step analyzes said content for semantic and 
conceptual use; 

25 providing a set of i nformationa l reference documents; 

wherein said informat i ona l reference documents comprise any of text, 
Web, and media documents; 

providing a pre-processed analysis of said informat i ona l reference 
documents; 

30 wherein said pre-processed analysis is an analysis of said i nform a t i on al 

reference documents for semantic and conceptual use; 
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identifying i nformat i ona l reference documents related to said analyzed 
content using said pre-processed analysis; 

wherein said identifying step identifies related i nformat i ona l reference 
documents by finding informational documents that are similar in words, 
5 semantically or conceptually, to the analyzed content; 

providing a user with a description of each identified i nformat i ona l 
reference document; 

accepting user input for selecting an identified informationa l reference 
document; and 

10 displaying the selected identified informat i ona l reference document to the 

user. 

15. (currently amended) A process for real time analysis of media content and 
relating information to the content, comprising the steps of: 
15 extracting metadata from said media content in real time as said content is 

being viewed by a user; 

providing a set of i nformat i ona l reference documents; 
wherein said informat i onal reference documents comprise any of text, 
Web, and media documents; 
20 providing a pre-processed analysis of said i nformat i ona l reference 

documents; 

wherein said pre-processed analysis is an analysis of said i nformat i ona l 
reference documents for semantic and conceptual use; 

identifying informat i ona l reference documents related to said metadata 
25 using said pre-processed analysis; 

wherein said identifying step identifies related informationa l reference 
documents by finding i nformat i ona l reference documents that are similar in 
words, semantically or conceptually, to said metadata; 

providing a user with a description of each identified informat i ona l 
30 reference document; 
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accepting user input for selecting an identified i nformationa l reference 
document; and 

displaying the selected identified i nformat i ona l reference document to the 

user. 

5 

16. (currently amended) The process of Claim 15, wherein a broadcaster 
provides customized informationa l reference documents and specifies their 
relevance to be used by said identifying step. 

10 17. (currently amended) The process of Claim 15, wherein a producer of said 
media content provides customized i nformat i on al reference documents and 
specifies their relevance to be used by said identifying step. 

18. (original) The process of Claim 15, wherein said extracting step creates 
15 metadata for said media content by analyzing said media content if said media 

content does not have associated in-band metadata. 

19. (currently amendedl) An apparatus for real time analysis of any of text 
and[[/or]] media content and relating information to the content, comprising: 

20 a module for analyzing said content in real time; 

wherein said analyzing module analyzes said content for semantic and 
conceptual use; 

a set of informationa l reference documents: 

wherein said i nformat i ona l reference documents comprise any of text, 
25 Web, and media documents; 

a pre-processed analysis of said i nformat i ona l reference documents; 

wherein said pre-processed analysis is an analysis of said i nformat i ona l 
reference documents for semantic and conceptual use; 

a module for identifying i nformat i onal reference documents related to said 
30 analyzed content using said pre-processed analysis; 
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a module for providing a user with a description of each identified 
i nformat i onal reference document; 

a module for accepting user input for selecting an identified i nformational 
reference document; and 
5 a module for displaying the selected identified i nformat i ona l reference 

document to the user. 

20. (currently amended) The apparatus of Claim 19, wherein said identifying 
module identifies related i nformational reference documents by finding 

10 informational documents that are similar in words, semantically or conceptually, 
to the analyzed content. 

21 . (currently amended) The apparatus of Claim 19, further comprising: 

a module for storing descriptors for each i nformational reference 
15 document. 

a module for retrieving descriptions of each identified i nformat i ona l 
reference document from said stored descriptors. 

22. (currently amended) The apparatus of Claim 19, wherein said set of 
20 informat i ona l reference documents are stored in a central storage device. 

23. (currently amended) The apparatus of Claim 19, wherein said pre- 
processed analysis creates a list of words and calculates the frequency that the 
words appear in said set of i nformat i ona l reference documents. 

25 

24. (original) The apparatus of Claim 23, wherein said pre-processed analysis 
translates similar words into the same word. 

25. (currently amended) The apparatus of Claim 19, wherein said pre- 
30 processed analysis generates collocations of words that appear together and 
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calculates the frequency of pairs of words and the frequency of the words 
appearing together in said i nformat i onal reference documents. 

26. (original) The apparatus of Claim 25, wherein said pre-processed analysis 
5 finds relations between collocations to learn their meaning/context. 

27. (currently amended) The apparatus of Claim 19, wherein said pre- 
processed analysis uses a signature algorithm to calculate signatures for blocks 
of text, wherein a signature is a vector of words and their weighting within a[[n]] 

10 i nformat i onal reference document; wherein the weighting is determined by the 
importance of a word in the collocations and within the document. 

28. (original) The apparatus of Claim 27, wherein said pre-processed analysis 
calculates signatures for Web pages, text tags associated with images, and 

1 5 blocks of text. 

29. (original) The apparatus of Claim 27, wherein said pre-processed analysis 
creates an index for each word from a signature vector for an informational 
document and saves the index, word, text document, and weight of the word into 

20 a database that is used to find text documents that have similar signatures. 

30. (original) The apparatus of Claim 27, wherein said pre-processed analysis 
uses the signatures and weights of the words to create sets of documents that 
have similar signatures. 

25 

31 . (currently amended) The apparatus of Claim 19, further comprising: 

a module for collecting text documents and multimedia from Web pages 
across the Internet using a Web crawler and placing them into said set of 
informationa l reference documents. 

30 
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32. (currently amended) An apparatus for real time analysis of any of text 
and/[[or]] media content in a workflow application and relating information to the 
content, comprising: 

a module for automatically analyzing said content in real time as said 
5 content is being entered or reviewed by a user; 

wherein said analyzing module analyzes said content for semantic and 
conceptual use; 

a set of informationa l reference documents; 

wherein said i nformat i ona l reference documents comprise any of text, 
10 Web, and media documents; 

a pre-processed analysis of said i nformational reference documents; 
wherein said pre-processed analysis is an analysis of said i nformat i ona l 
reference documents for semantic and conceptual use; 

a module for identifying inform a t i on al reference documents related to said 
15 analyzed content using said pre-processed analysis; 

wherein said identifying module identifies related i nformat i ona l reference 
documents by finding informat i ona l reference documents that are similar in 
words, semantically or conceptually, to the analyzed content; 

a module for providing a user with a description of each identified 
20 i nformat i ona l reference document; 

a module for accepting user input for selecting an identified i nformat i ona l 
reference document; and 

a module for displaying the selected identified informat i ona l reference 
document to the user. 

25 

33. (currently amended) An apparatus for real time analysis of media content 
and relating information to the content, comprising: 

a module for extracting metadata from said media content in real time as 
said content is being viewed by a user; 
30 a set of informat i ona l reference documents; 
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wherein said informat i onal reference documents comprise any of text, 
Web, and media documents; 

a pre-processed analysis of said i nformat i ona l reference documents; 

wherein said pre-processed analysis is an analysis of said i nformat i ona l 
5 reference documents for semantic and conceptual use; 

a module for identifying i nformat i on a l reference documents related to said 
metadata using said pre-processed analysis; 

wherein said identifying module identifies related i nformat i ona l reference 
documents by finding i nformat i ona l reference documents that are similar in 
10 words, semantically or conceptually, to said metadata; 

a module for providing a user with a description of each identified 
i nform a t i on al reference document; 

a module for accepting user input for selecting an identified i nformationa l 
reference document: and 
15 a module for displaying the selected identified informational document to 

the user. 

34. (original) The apparatus of Claim 33, wherein a broadcaster provides 
customized informational documents and specifies their relevance to be used by 

20 said identifying step. 

35. (currently amended) The apparatus of Claim 33, wherein a producer of 
said media content provides customized i nformation a l reference documents and 
specifies their relevance to be used by said identifying step. 

25 

36. (original) The apparatus of Claim 33, wherein said extracting step creates 
metadata for said media content by analyzing said media content if said media 
content does not have associated in-band metadata. 
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